Decoupled Vector Architectures: a Rst Look
نویسندگان
چکیده
The purpose of this paper is to show that using decoupling techniques in a vector processor, the performance of vector programs can be greatly improved. We will show how, even for an ideal memory system with no latency, decoupling provides a signiicant advantage over standard mode of operation. We will also present data showing that for more realistic latencies, decoupled vector architectures perform substantially better than non-decoupled vector architectures. We will also introduce a bypassing technique between the queues and show how it can reduce the total memory traac. A side eeect of the decoupling technique presented is that it tolerates so well long memory latencies that could make feasible to use very slow DRAM parts in vector computers in order to reduce cost.
منابع مشابه
Performance of the decoupled ACRI-1 architecture: the perfect club
This paper examines the performance potential of decoupled computer architectures on real-world codes, and includes the rst performance bounds calculations to be published for the highly-decoupled ACRI-1 computer architecture. It also constitutes the rst published work to report on the eeectiveness of a decoupling Fortran90 compiler. Decoupling is an architectural optimisation which ooers very ...
متن کاملEffective usage of vector registers in decoupled vector architectures
Thz.spaptr presemts a study of the tmpact ofreduclng the vector regtsterstze m a decoupled vector architecture. In traditional in-order vector architectures, loltqvectorr egzstersh avetypically been the norm. We start presenting data that shows that, even for highly ucctorz.~able codes, only a small, fraction ojall elements of a long vector regzster are actually used. Lfre also show that reduct...
متن کاملSpeculative Parallelization in Decoupled Look-ahead Architectures
One well known approach to mitigate the impact of branch mispredictions and cache misses is to enable deep lookahead so as to overlap instruction and data supply with instruction processing. A continuous look-ahead process which uses separate thread of control on another hardware contexts is one such approach which we call decoupled look-ahead [1], [2]. However, in such look-ahead schemes, look...
متن کاملMemory Decoupled Architectures and related issues Guest Editor’s Introduction
It is my great pleasure to serve as guest editor for this special issue of TCCA Newsletter, which is hosting eight papers from the MEDEA (MEmory DEcoupled Architectures) Workshop, jointly held with PACT-2000 conference. The rationale behind this workshop was to revive the original idea of Memory Access Decoupling, presented in the famous paper of Jim Smith, “Decoupled Access/Execute Architectur...
متن کاملDecoupled Architectures for Complexity-Effective General Purpose Processors
Decoupled architectures have previously been investigated in the context of high performance scientific computing. For general purpose computing, however, superscalar processors have proven to be flexible in providing high performance across a wide range of applications. To achieve this goal, these architectures have incorporated enormous amounts of complexity to obtain modest performance impro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995